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Abstract. We proposed in [T] a new approach to prove the metastable be- 
havior of reversible dynamics based on potential theory and local ergodicity. 
In this article we extend this theory to nonreversible dynamics based on the 
Dirichlct principle proved in 



1. Introduction 

Metastability is a relevant dynamical phenomenon in non-equilibrium statistical 
mechanics which occurs in the vicinities of first order phase transitions |17j . |10) . 
In the sequel of [151 [3 [5], we proposed in pQ a new definition of metastability 
for continuous time Markov chains which visit points, and we presented in the 
reversible case three simple conditions which ensure the metastable behavior of a 
chain. All three conditions are formulated in terms of the stationary measure and 
of the capacity between the metastable sets. 

This method has been applied successfully in [2] to prove the metastable behavior 
of the condensate in reversible zero range processes evolving on finite sets, in [SI [4] 
to examine the metastability of reversible Markov chains evolving on fixed finite 
sets, and in |12] 113] to investigate the scaling limits of trap models on random 
graphs. 

The proof of the metastable behavior of a reversible Markov chain presented in 
P] relies essentially on two ingredients. On the one hand, the potential theory of 
reversible Markov chains (6) [10] permits to express the capacity between two sets 
in terms of a variational problem, the so-called Dirichlet principle, and provides 
simple formulas for the expectations of time integrals, carried over time intervals 
ending at the hitting time of a set, in terms of the equilibrium potentials. On the 
other hand, the local ergodicity allows to replace a time integral of a function by 
the time integral of the average of the function over the metastable sets. 

We generalized recently in [11] the potential theory of reversible Markov chains 
to the non-reversible context by defining a capacity between two sets and by proving 
a Dirichlet principle which involves, in contrast with the reversible double 
variational formula. 

In this article, we extend the theory elaborated in [1] to nonreversible dynamics 
aiming to examine the metastable behavior of the condensate in asymmetric zero 
range processes evolving on a one-dimensional finite torus [14J . The main result, 
stated in Theorem 12.11 below, asserts that the metastable behavior of a sequence of 
continuous time Markov chains follows from the same three conditions formulated 
in the reversible case. 
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One of the conditions, assumption (HO) below, requires the average jump rates 
of the trace process to converge. While in the reversible case the average rates 
can be expressed in terms of capacities, [TJ Remark 2.9], in the nonreversible case 
such formula is not available. We may, however, formulate a double variational 
problem whose optimal solution is equal to the average rate. This is the content of 
Proposition 12.21 

It may seem that the characterization of the average jump rate provided by 
Proposition 1 2 . 2 1 is deprived of interest and impossible to use in concrete examples. 
This is not the case as shown in [TJ] where we apply this result to obtain the 
asymptotic value of the average jump rates. This is done in two steps. We first 
compute the limit of (|2.5|) as N t oo and derive the optimal value for / on the set 
£- N - We then show that if the supremum in (|2.5p is carried over functions h which 
are not equal to the optimal value for / on E N , the limit of (|2.5|) is strictly smaller 
than the one obtained before. 

The last main result of the article, Proposition 12.31 asserts that the adjoint 
of a Markov chain which exhibits a metastable behavior is also metastable and 
its asymptotic dynamics is the adjoint of the asymptotic dynamics of the original 
process. 



2. Notation and Results 

Fix a sequence (En : N > 1) of countable state spaces. The elements of En are 
denoted by the Greek letters 77, £. A sequence of states q = (r) N E En : N > 1) is 
said to be a point in a sequence A of subsets of En, A = (An C En ■ N > 1), if 
r] N belongs to An for every N > 1. 

For each N > 1 consider a matrix Rn ■ En x En — > M such that Rn(v,0 > 
for 77 ^ £, — 00 < Rn(Vi V) < and X^s-En -^(77, £) = f° r au V e ^n- Denote by 
Ln the generator which acts on bounded functions / : En — > K as 

(L N f)( v ) = ]T Rn(v,0 - f(v)} ■ (2.1) 

Let {77^ : t > 0} be the minimal right-continuous Markov process associated to 
the generator Ln- It is well known that {77^ : t > 0} is a strong Markov process 
with respect to the filtration {J 7 ^ : t > 0} given by — <r(i]^ : s < t) [9| I16j. 
We assume throughout this section that for each N > 1 the process {77^ : t > 0} is 
irreducible and positive recurrent. 

Denote by fj, the unique stationary probability measure and let X N (rj) = 
—R(rj,i]), rj E En, be the holding rates of the process 77^. We assume that the 
holding rates are integrable: 

J2 M'yWM < 00, vw > 1 . (2.2) 

Denote by I?(K+, En) the space of right-continuous trajectories with left limits 
endowed with the Skorohod topology. Let , 77 E En , be the probability measure 
on D(M. + , En) induced by the Markov process {77^ : t > 0} starting from 77. Ex- 
pectation with respect to is denoted by and we frequently omit the index 



METASTABILITY OF CONTINUOUS TIME MARKOV CHAINS 



3 



Denote by Ha (resp. Hj[), A C En, the hitting time of (resp. return time to) 
the set A: 

H A := inf{f >0:))fsA}, 

H\ := inf{t > : r?f G A , rtf ^ r$ for some < s < t} . 

Denote by L 2 (hn) the space of square integrable functions / : En — > M, 
J2neE N fiWrPNiv) < °°i endowed with the usual scalar product. Let L* be the 
adjoint of L in L 2 (/in)- The operator L* corresponds to the generator of a Markov 
process, denoted by : t > 0}, whose rates are represented by R N . Denote by 

P* = P^' N , r\ G En, the probability measure on D(BL+, En) induced by the adjoint 
Markov process starting from r\. Expectation with respect to P^' N is represented 
byE^ 

Fix a finite number of disjoint subsets E N , . . . , E N , n > 2, of En- E n CiE v n = 0, 
x ^ y. Let S — {1, . . . , k}, En = ^xesE N and let An — En \ En so that 

E N = E N U • • • U E N U A N = E N U A w . (2.3) 

Let £^ := £^ \ £^, 

£ = (£jv : N > 1) , E x = (E X N : N > 1) and E* = (8.% : N > 1) . 

Denote by {% N : t > 0} the trace of {rtf : t > 0} on E N [L. Let i?^ : 
£at x £^r — > R + be the transition rates of the trace process {r]f N : t > 0}, and 
denote by tn{E n , E y N ) the mean rate at which the trace process jumps from E X N to 

r N (E N ,E N ) := — ±— ]T MV) E R n(v,0 , (2-4) 

and let 

For two disjoint subsets A, i? of 23jv, denote by cap Ar (A, B) the capacity between 
A and £?, defined in 11 by (|6.7|) for irreducible positive recurrent Markov processes. 
For a point £ T = (£**: N > 1) in let 

capjv(4) = inf c&p N ({v}AC}) ■ 

Recall the notion of tunneling behavior and recall properties (Ml), (M2), (M3) 
introduced in Definition 2.2 of pQ. 

Theorem 2.1. Suppose that there exists a sequence 6 = {9n ■ N > 1) of positive 
numbers such that, for every pair x,y G S , x ^ y, £/ie following limit exists 

r(x,y) := lim ^^(6^,8^). (HO) 

Suppose that for each x € S, there exists a point £ x = : N > 1) in E x such 
that 

lim CaPjv(£ ^ £ ^ ) = . (HI) 
jv^oo capj V (f a .) 

Then, for any points {£ x *E E x : x £ S}, properties (Ml) awe? (M2) o/ tunneling 
hold on the time-scale 6, with metastates {E x : x G S 1 }, metapoints {C x ■ x G 5} 
and asymptotic Markov dynamics characterized by the rates r{x,y), x,y G 5. 
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If in addition we have that (M3) holds for each x € S which is an absorbing 
state of the Markov dynamics on S determined by the rates r and that 

lim ^i^) = o (H2) 

for each non-absorbing state i£S, then property (M3) holds for every x G S. 

In contrast with the reversible case, we are not able to express the average rates 
rjv(£^r, E V N ), x, y £ S, in terms of capacities. We may, however, formulate a 
variational problem whose optimal solution is equal to the average rate. 

Proposition 2.2. Fix x =/= y in S and consider the variational problem 

infsnp{2(f,L N h) llN - (h, {-L N )h) m }, (2.5) 

/ h 

where the infimum is carried over all functions f : En — ¥ K which are equal to 1 on 
E V N , on £jy \ (£^ U E V N ), and which are constant on E N , while the supremum is 
carried over all functions h : En — > R which are equal to on En \ (E N U £- v N ) an d 
which are constant over E V N , E X N , with possibly different values at each set. Then, 
the optimal function h associated to the optimal function f is such that 

r N (E%,E x N ) 



This result is a particular case of Proposition l4.2l below. We conclude this section 
with two remarks on metastability. The first one concerns the metastable behavior 
of the adjoint of processes which exhibit a metastable behavior, while the second 
one points out that condition (HO) alone guarantees the metastable behavior of 
the collapsation of the trace process. 

Proposition 2.3. Suppose that a sequence of Markov process {r/^ : t > 0} satisfies 
the hypotheses of Theorem \2.1l that /j,n(E n ) converges to a strictly positive limit 
m{x) for each x € S and that Markov dynamics on S determined by the rates r 
has no absorbing states. Then, m is a stationary probability measure for this as- 
ymptotic dynamics, the adjoint process {r]^' N : t > 0} also satisfies the assumptions 
of Theorem \2.1\ and the limiting jump rates introduced in condition (HO) for the 
adjoint process, denoted by r*(x,y), are the adjoint rates of the asymptotic dynam- 
ics with respect to the measure m. In particular, the adjoint process also exhibits a 
tunneling behavior. 

Proof. Denote by r N (E N ,E N ), x ^ y e S, the mean jump rates of the trace on £jv 
of the adjoint process. By definition of m and r, m {y) r {yi x ) IS equal to 



lim ji n (E n )6n rN(E v N ,E x N ) = nm / ) Vn(E n ) 6n r%(E N , £ 



N—too L — J N— >oo ' 

y^x yjtx 



where the last identity follows from the explicit expressions for the jump rates of 
the trace processes presented in [TJ Proposition 6.1]. By (|6.8I) and (|6.10l) . 

where cap^ represents the capacity with respect to the adjoint process. By 
Lemma 2.3] and by formula (2.4) in that article, cap^(£^,£^) = cap Ar (£^ r , £^) 
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so that 

lim y^u, N (E y N )9 N r N (E N ,£, N ) = lim hn(£-n) Y] °n r N (£ N , £ at) > 

y=£x y^x 

that is, J2y=£x m (y) r (yi x ) = m ( x )J2y=£x r ( x 'y)' proving that m is a stationary 
measure. 

To prove the second assertion of the proposition, note that condition (HI) is 
satisfied because the measure and the capacities of the adjoint process coincides 
with the measure and the capacities of the original process. 

On the other hand, as we have seen above, 

for all x ^ y in S. Hence, by condition (HO) for the original process and by 
assumption, 

lim e N r* N (£ x N ,£ y N ) = ^f^-r(y,x) =: r*(x,y) . 
iV->oo m(x) 

This proves condition (HO) for the adjoint process and shows that the dynamics 
on S induced by the rates r sr (x,y) has no absorbing point since 

^2r*(x,y) = ^r(x,y) > 

y^x y=£x 

by assumption. In particular, condition (H2) for the adjoint process is in force. □ 

Recall that we denote by {?7 t £jv : t > 0} the trace of the process -q^ on £at. 
Denote furthermore by {?yf : t > 0} the Markov process on S = {1, . . . , k} obtained 
from r/f N by collapsing each set £^ to the point x [IT]. The jump rates of the 
Markov process rif , denoted by rff(x, y), are given by 

r ff( x >y) = r N(£ N ,£ N ) > x^yeS. 

In particular, under assumption (HO) the speeded up Markov process 77^ con- 
verges to the Markov process on S characterized by the rates r(x,y). 

3. Proof of Theorem 12.11 

We prove in this section the main result of the article. Given two disjoint subsets 
A,B C E N , let 

f AB (v) := [H A <H B ], f* AB (v) ■= P; [ H A < H B } . 

In addition, for point 77 = (r) N : N > 1), £ = (£ N : N > 1) in £, rj N ^ £ N , and a 
set A = (A N : N > 1), set 

/at(»7=€) = /{>}{?"} ' fN(v,A) = f{ v N }AN , cap N (v,$.) = cap^-fr?^}, {£ N }) , 

and a similar notation without the upper index *. 

For two subsets A, B of En, denote by Ha(B) the time the process 77^ spent 
on the set B before hitting the set A: 



H A (B) = / l{ryf €B}ds 
Jo 
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Fix x G S. By the definition of H^n(E.%) given just above, by (|6.13|) below, and 
by identity (|6.8[) for the capacity, 



E, 



rjv(£Ar, £at) E^jv [iJew (£^)] 



) (i{£^}, fN{v,€))n> 



Since < f^{v,0 < 1, and since ( i?jv( • , £^) 1{£^} ) MJV = n N {E%)r N (8.^, £^), 
by identity (|6.8p again, we obtain the following estimates which will be used re- 
peatedly below 



E, 



< 



cap w (£^ r , £^r) 



capwd. 



rAr(£^,£^)E^jv[^«(£^)] < 



cap w (£^, £^-) 



(3.1) 



cap;v(£x) 

Lemma [3~T1 below is Proposition 5.10 of [T|. The proof needs some modifications 



to take into account the lack of reversibility. Let R 



*,£jv 
N 



£jv x £ 

*,N 



N 



be 



the transition rates of the trace on £at of the adjoint process r) t ' . Denote by 
r^r(£^, £jy) the mean rate at which this trace process jumps from £^ to £^: 

and let R*/ N (rj, E y N ) = E^ee^ r n &N (v>Oi V + x ^ 

* (ox px \ _ * icx cV \ 

y^x 

Lemma 3.1. Fix x € 5. Assume condition (HI) for some point £ x = (£^ : N > 1) 

in E x . Then, for every point £ = (£ N : N > 1) in £ x , 

cap Ar (£^ r , £^) _ 



lim 



(3.2) 



lim inf P*[F f w <Hx„] = 1 , lim inf PJH t s < H* x } = 1 . (3.3) 

Proof. Fix x £ S and some point £ x — : N > 1) in t x for which (HI) holds. 
In view of (|6.9p . by p. II) for the adjoint process and by assumption (HI), for every 
point r/ — (j] N : N > 1) in E x , 

cH t 



lim E* 



AT— >oo 



7? 



*.£jv (^*,N px 
N 







iV 



lim r^(£^,£^)E; N [F e (£^)] = 



The assumptions of Jfl Proposition 5.1] are thus fulfilled for the adjoint process and 
with W = £*, £ c = £* and the point £ x = : N > 1). 

Fix an arbitrary point £ = (( N : N > 1) in E x . We show that conditions (i) and 
(ii) of [TJ Proposition 4.2] are in force for the adjoint process and with W = £ x , 
■B c = £ x , £ = C 
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Fix a point rj = {if : N > 1) in £ x . By (|Q3]) applied to rj N , g = 1{£^}, 
= ti N }, and to C N , 9 = B = {( N }, 

E,>[ff c »(£^)] < e;„[Hs»(£ x n )} +E* n [H c ,(^ n )} 

< Vn(£n) 1{ti n ^Z n ] I ^ n ^n) ir^^^j 
cap N (r] N ,£, N ) c&p N (( N ,£ N ) 



< 



NJ 



cap w (£iv) 

It follows from this estimate, identities (16.8[) . (|6 .9[) and assumption (HI), that 
lim r^(£^,£^)E,>[ J ff c »(£^)] - 0. 

Therefore, applying the assertion (5.1) of [TJ Proposition 5.1], to W = E x , T> c = £ a 



I™ [f^Sli 1 = J™ ^(£^,£^)E;„[ff CN (£^)] = 
Jl, „[iig x (£at)J JV^oo 



" ,' iJNi 

This result shows that condition (i) of [TJ Proposition 4.2] holds. 

Fix a point 77 = (77^ : N > 1) in E 31 . By [TJ Proposition 5.1], under P* N , 
i?gx (£^,)/E* JV [TJg^ (£^)] converges to a mean one exponential distribution, which 
is condition (ii) of [TJ Proposition 4.2]. 

By items (i) and (ii) of Proposition 4.2 we conclude that (£ x , £. x , £) is a valley 
for the trace on £^r of the adjoint process. Hence, for any point ij = (77 : N > 1) 
in £ x 

which implies condition (VI) for the triple (£ K , £, £) because clearly {H^n(En) < 
-ffjx (£iv)} £ {-ff^w < iJg x } P* N -a.s., proving the first assertion in (|3.3[) . The 
proof of the second one is exactly the same replacing the adjoint process by the 
original one. 

The arguments presented in the first part of the proof, applied to the original 
process instead of the adjoint process, show that the assumptions of [TJ Proposition 
5.1] are in force for the original process and with W = E x , 23 c = £ x and the point 
£ x = : N > 1). Fix a point C = (£ N ■ N > 1) in £ x . By (|fU3)l with A = {( N }, 
B = t x N and g — l{£^r}, and by identity (|6.8j) . the limit (5.1) in TJ Proposition 
5.1] can be re- written as 

m (Z x N ) ca p N (( N ,Z x N ) 

By (031), the infimum of £ x ) over £^ converges to 1 as N "f" 00. Therefore, 

(|3.2|) follows from this observation and the previous identity. □ 



Proof of Theorem \2.1\ The proof of this result is similar to the one of [TJ Theorem 
2.7]. We first need to check that all assumptions of [TJ Theorem 2.4] are in force. 

By (|3.1j) . conditions (CI) and (C2) in [TJ Theorem 2.4] follow from the stronger 
condition (HI). Therefore, by [TJ Theorem 2.4], properties (Ml) and (M2) of 
tunneling hold on the time-scale 6, with metastates {E x : x S S}, metapoints 
{£ x : x £ S} and asymptotic Markov dynamics characterized by the rates r{x 1 y), 
x,y e S. The estimate ()3.3j) shows that we can replace the metapoint £ x by any 
point C x m & x ■ 
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We turn to condition (M3). Fix x G S and t > 0. Decomposing the time interval 
[0, f0/v] according to the successive visits of the trace process r\ t N to the metastates 
£. Z N , z G S, we see that to prove condition (M3) for all x G S 1 , it is enough to show 
that this condition is in force for each absorbing state x for the Markov dynamics 
on S determined by the rates r and that 

limsup sup -— EjJifg* (Ajv)1 = 

holds for each non-absorbing state x. 

Fix a non-absorbing state x G S. By (|6.13|) and since /*- is bounded by 1, 

the expectation is less than or equal to /Ltjv(Ajv)/capjv(?7, E N ). By Lemma 13. 11 we 
may replace asymptotically r\ by E N in the previous ratio. Since cap(£^,£^,) = 
\in(£-n) t n(£-n ■> £.%), we have shown that 

limsup sup -j-~E v [H$ x (An)] < limsup = — 7 ^TcFT ' 

at-s-oc v ez* N Vn n n-kx 6 N r N (t x N ,?, N ) Hn{c n ) 

As a; is a non-absorbing point, by assumptions (HO). (H2), the right hand side 
vanishes, which concludes the proof of the theorem. □ 



4. A FORMULA FOR THE AVERAGE JUMP RATE 

Consider an irreducible chain {r)(t) : t > 0} on a countable state space E. Let 
B C F be two non-empty subsets of E. Denote by {77 (t) : t > 0} the Markov 
process on (E \ B) U {0}, where i) E is an extra pointed added to represent the 
collapsed set B, obtained from r](t) by collapsing the set B to a point [IT], and 
denote by {r] T (t) : t > 0} the trace of the process r](t) on the set F [JJ. Denote 
furthermore by {r] T (t) : t > 0} the trace of the Markov process r) c (t) on the set 
G = (F \ B) U {£)}, and by {r] CT (t) : t > 0} the process 77 (t) where the set £? 
has been collapsed to a point D. Note that the state space of both processes is 
G=(F\B)U{d} 

Lemma 4.1. The processes {r] T (t) : t > 0} and {r] (t) : t > 0} have the same 
law. 

Proof. Both processes are Markov processes on G — (F \ B) U {5}. To prove the 
assertion it is therefore enough to check that the jump rates are equal. 

Denote by Rc, Rt, Rct, Rtc the jump rates of the Markov processes ri c (t), 
i] T (t), rj CT (t), n TC (t), respectively. Let 77 G F \ B. We show that R TC (Q,r)) = 
Rct(Q, v) an d leave the other identities to the reader. 

By [1, Proposition 6.1], R T c(D,v) = -M^P^c = where P ? stands 

for the probability measure on the path space D(R+, (E \ B) U {£>}) induced by 
the Markov process r) C (t) starting from 0. The probability P%'[Hq = can be 
written as the sum of the probabilities of all paths connecting to 77 through points 
which do not belong to G: 

PC[H+=H+] = J2pc&&---Pc(U-i,v) , 

7 

where the sum is carried over all paths 7 = (£0 = 3, £1 , . . . , £ n —i> £n — v) such that 
£j • G, 1 < j < n — 1 or, equivalently, such that ^- e E\F . In this formula, pc 
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stands for the jump probabilities of the collapsed chain. Hence, 

Rrcfav) = '£ f Rc( 1 t>,Zi)Pc(Zi&)---Pc(€n-i, i n)' 

7 

By equation (3.3) in [ITJ, pc(Z,Q = P&C) if 6 C G E \ B and JZcfoft) = 
/j(i?) _1 J2(eB A t (C)-R(Ci The previous expression is thus equal to 

^ ' CeB 7 

On the other hand, by equation (3.3) in [IT] and Proposition 6.1 in [1 , for 

RcT ^ = ■JmT.^ORACv) = -^5>(C)A(C)Pc[i^ = ^ + ] . 

This proves the lemma. □ 

In the next result we consider an irreducible Markov process {77(f) : f > 0} on a 
finite state space E. Let Aq, A\, B be three disjoint subsets of E and let Rt denote 
the jump rates of rj T (t), the trace of the process 77(f) on the set F = A U A\ U B. 
We denote by Tt(B, A4), i = 0, 1, the average rate at which the trace process jumps 
from B to A^. 

r T {B,Ai) = -J- M(»?) E R tM - • 

Proposition 4.2. Consider the variational problem 

infsup{2(/,L%-(/i,(-L)/i) M }, (4.1) 

/ h 

where the infimum is carried over all function f : E —> R which are constant at 
each set Aq, A\, B, equal to 1 at A\ and equal to at Aq, while the supremum 
is carried over all functions h which are equal to at A and which are constant 
over A\, B, with possibly different values at each set. Then, the optimal function 
h associated to the optimal function f is such that 

V ' r T (B,A )+r T (B,A 1 ) 

Proof. Denote by Lq the generator of the Markov process 77° (f) on Eb = (E\B)U 
{0}, where the set B has been collapsed to a point D g 1 E. For a function g : E — > R 
constant over B, denote by ~g : Eb — > R the function which is equal to g at E \ B, 
and such that g(U) = g{B). 

Denote by ~p the probability measure on Eb equal to /i on E \ B and such that 
/Z(D) = n{B). A calculation, performed below equation (3.7) in [TT] , shows that for 
any pair of functions f,h:E—>M. constant over B, 

</,L% = <7,L C %. (4.2) 

Conversely, given /, h : Eb — > R, if we define /, h : E — !> R to be equal to /, h on 
E\B and such that f(rj) = /(0), h(r)) = h(d), 77 G B, @3) holds. 
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It follows from (|4.2j) that the variational problem ()4.1|) is equivalent to the vari- 
ational problem 

Mswp{2{f,L c h) 1 T - {h,{-Lc)h)Tx}, 

f h 

where the infimum is carried over all functions / : Eb — > R which are equal to 
1 at A\ and equal to at Aq, while the supremum is carried over all functions 
h which are constant over Aq, A\. This variational problem corresponds to the 
variational problem for the capacity between A\ and Aq for the Markov process 
rj c (t). By [Til Theorem 2.4], the optimal function h : Eb R. associated to the 
optimal function / is the harmonic function h{rj) = P < ^[Ha 1 < Ha ]- In particular, 
h(j>) = Pg [Hai < Ha ] and the optimal function h : E — > R associated to the 
optimal function / for the variational problem (|4.ip is such that 

h(B) = pC[H Al <H Ao ] . (4.3) 

Decompose the event {Ha x < Ha } according to whether < Ha x or H$ > 
Ha x and use the strong Markov property to get that 

per- . H i P C [gA 1 <min{g+,g Ao }] 

The denominator can be written as P^[Ha x < mm{H£ , H A „}] + Pj?[H Ao < 
mm{H^ , Ha 1 }]• Taking the trace of the process r\ c (t) on the set Ao U A\ U {0}, we 
get that 

v c [n . n i = Pl c [H Al <mm{H+,H Ao }] 

° L Al A ° J Pl C [H Ao < mm{H+,H Al }] +^[H Al < min{H+,H Ao }] ' 

By Lemma 14.11 we may replace the probability P 5 by the probability Pj in 
this formula. Since P$ c [Ha < min{i?^" , if^ }] is equal to pct(Q, Ao), where per 
stands for the jump probabilities of the process r) (t), multiplying the denominator 
and the numerator by Acr(f), the previous ratio becomes 

ggr(Mi) 

Rct^^Aq) + R C t(Q,A 1 ) ' 

By [U Proposition 6.1] and since the stationary measure for the trace process is the 
conditional measure, this expression is equal to 

KBr'E.eB^Rr^hA,) 

KB)- 1 £„ es Kv)Rt(v, A>) + KB)- 1 E v eB Kv)Mv, At) ' 

By definition of tt(B, Ai) we conclude that 

P [H Al < H Ao ] - rT(B;Ao)+rT(B;Ai) ■ 
This identity concludes the proof of the proposition in view of (|4.3[) . □ 

5. Birth and death process 

We prove in this subsection the metastable behavior of a class of birth and death 
processes which includes the zero range processes examined in [2] evolving on two 
sites. Proposition 15.11 presented below was used in [5] to prove the metastability of 
reversible zero range processes. 
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Fix a < b in R and consider a nonncgative smooth function H : [a,b] —> K+. 
Assume that H vanishes only at a finite number of points denoted by an < ai < 
■■■ <a m : 

H(x) = if and only if x € {an, . . . , a m } . 
We do not exclude the possibility that H vanishes at the boundary points a, b. 

For each i = l,...,m, assume that there exist a neighborhood V ai of ai and 
Qj > such that 

H(x) = \x - cii\ a > for all x e V ai , 

and that V ai n V aj = for i ^ j. Let a — max{a t : 1 < i < m}. Assume that 
a > 1 and that there are at least two exponents equal to a: 

k := \{i : a, = a}| > 2 , 

where \A\ indicates the cardinality of a finite set A. Denote by 61 < b 2 < ■ ■ ■ < b K 
the elements of {on, . . . , a m } whose associated exponents are a. 

The definition of the state space of the birth and death process requires some 
notation. Fix N > 1, 1 < i < m — 1, and let 

= min jfc > : a > on - (fc + 1)/a| , 
fcf = min{/c>0 : a, + (k + l)/N > a l+l - [k + l)/jv} , 
k% = min |fc > : a m + (k + 1)/N > &} . 
For 1 < i < m — 1 , set 

Gjv,o = {ai-A^/JV,...,ai-l/JV,ai}, 

G N , 2 = {^ + a, + 2/N, . . . , a, + fcf /JV, a J+1 - k? /N, . . . , a 4+1 } , 
G N , m = {a m + l/N,a m + 2/N,...,a m + k^/N} 

and let the state space E N be 1J™ G^,i- Note that the exact definition of E N is not 
important for the met a- stability behavior discussed in this section. The elements 
of E N are denoted by the letters x, y, z. Two points x < y are said to be neighbors 
in En if there is no z in En such that x < z < y. 

Let vn be the probability measure on En defined by 

1 1 



v N {x) = < 



Zn H(x) 

——N ai if x = ai for some 1 < i < m . 
Zn 



In this formula Zn is a normalizing constant. An elementary computation shows 
that 

Hm = + (5.1) 

i=l k>l 

where Oi = 1 if bi £ {a, b} and at = 2 otherwise. In particular, if we denote 

1 



m(bi) = 1 + for i = 



fc>i 

then 



N^>oo 
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for every 1 < i < k. 

Let {£n ■ N > 1} be a sequence of positive integers such that 1 <C In N, 

In 

lim — = and lim £k = oo , (5.2) 



and, for each bi, 1 < i < k, define 

E N :— En n 



In , In 

— , bi H 

TV ' N 



Since TV 1 ^^r — > 0, for AT large enough E N C VJ )i for every 1 < i < n. In particular, 
for N large enough, E l N C\ 8. J N = for all i ^ j. Moreover, since £tv — > oo, 

7Ti ( ' ) 

lim ^jv(Ajv) = and lim vn{£ 1 n ) = ^TTT ( 5 - 3 ) 

N-*oo N^,oo z2j=l m (bj) 

for all 1 < i < k, where A N := E N \ (Uf =1 E N ). 

Fix a positive function $ : [a, b] — > M+ bounded above and below by a strictly 
positive constant: 

< S < $(x) < (T 1 . 
This assumption is not needed but we do not seek optimal assumptions. Consider 
a birth and death process {Zf : t > 0} on En with rates given by 

!$(x) if x > 2/, 

i/iv(xj 

provided x and y are neighbors in • The process is of course reversible with 
respect to the measure vn- 

Proposition 5.1. The Markov process [Zf : N > 1} exhibits a tunneling be- 
havior in the sense of [1] Definition 2.2] on the time scale N 1+a with metastates 
E N , metapoints bi, 1 < i < k, and asymptotic Markovian dynamics on {1, ...,k} 
characterized by the rates 

1 1 

r(i, i + 1) = 



Ah) f^ +1 {H(u)/$(u)}du 



1 1 

r(i + l,i) = — T T—r , l<i<n. 

m(h+i) J£ +1 {H(u)/<S>(u)}du 

Proof. We show that the hypotheses of [TJ Theorem 2.7] are in force. Let £ N = bi 
for 1 < i < k. The asymptotic dynamics has no absorbing point and condition 
(H2) of [TJ Theorem 2.7] follows from 

To check conditions (HO), (HI) wc take advantage from the one-dimensional 
setting to get explicit expressions for capacities. For two disjoint subsets A, B of 
En, denote by cap Ar (^4, B) the capacity between A and B. When A = {a} we 
represent cap w (j4, B) by capjv(a, B) with the same convention for B. Let x < y be 
points in En- Recall that cap N (x,y) = DN(fx,y) where f x , y '■ En K solves the 
equation LNf x ,y{z) — for z $ {x,y} with boundary conditions f x ,y(x) = 1 and 
fx.y(y) — 0. An elementary computation gives that f(z) = 1 for z < x, f(z) = 
for z > y and 

f(z + 1/AT) - f(z) — + W 
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for z £ En n [x, y). Hence, 

cax> N (x, y) = — — r . (5.4) 

In last two formulae, there is a slight abuse of notation since En is not the set 
{z/N : z £ Z n [aN, bN]}, but the meaning is clear. In particular, if {x N : N > 1}, 
{y N : N > 1} are two sequences in En such that x N — > a' and y N —> b' for some 
a < a' < b' < b, by (|5T4]) . 



Jim N l + a c^ N {x N ,y N ) = ^ K 1 { / {#(«)/<&(«)} 1 > 0. 

(5.5) 

Denote by rjv(£}v, £^) the average jump rate from E l N to of the trace of the 
process Z^ on En = Ui< m < K £^, defined by (|2.4p in a general context. Clearly, 
tn{E n , £jy) = for \i — j'| > 1. Fix an arbitrary 1 < i < k and let G\ = U 3 -<j£^y, 
and G2 = Uy>j£^ so that 

^(Eiv)^^,^ 1 ) = „ N (b l + e f)R%(b l + e f,b l+ i- i f) 

= iy N (Gi)r N (Gi,G2) , 

where R N (x,y), x ^ y £ En , represents the jumps rates of the trace of Z^ on £jv- 
Therefore, by (|5TH|> . 

rp« pi+i\ _ l, N(Gi)r N (G 1 ,G 2 ) _ ca,p N (Gi,G 2 ) 
cap N (h + In /N, b l+1 - l N /N) _ 

Analogously, we obtain that 

fQi C i-is cap N ( b i ~£n/N, bi-i +£ N /N) 
r A r(£ A r,£ Ar ) = - 

V N{E N ) 

for any 1 < i < k. Therefore, by (15 ,5|) and (|5.3[) . 

lim iV 1 +«r J v(£^,£j+ 1 ) = -J— { [ ^ {H(u)/Z(u)} du 

and 

a^oo m(£»j+i) L J bi J 

for any 1 < i < k, which concludes the proof of assumption (HO). 
The same arguments show that 



l+a 

tap 

i r 1 . 1 1 (■••«.) 



lim JV 1+Q cap Ar (£ Ar ,£ Ar ) 

AT— >oo 



Ei<,< K m ft) l / b b / +1 {#(")/$(")} <fe /,J , _ i {ff(«)/$(u)}du- 

provided bi ^ a, b, with similar identities if = a or if 6^ =6. 
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It remains to check condition (HI). For any 1 < i < n and N large enough, 
H{x) = \x — bi\ a for all x G £jy. In consequence, by (|5.1|) and (|5.4j) . there exists a 
positive constant Co, independent of N, such that 

capjv(a;,6i) > — ^ , 
for any 1 < i < n and x G £]y. Therefore, since In *C AT, by (|5.6|) . 

r Cap^r (£]v i £jV") n 

hm sup ^"Hr - = 

ca,p N {x,bi) 

for all 1 < i < k, which concludes the proof of the proposition. □ 



6. Potential theory for positive recurrent processes 

We state in this section several properties of continuous time Markov chains used 
throughout the article. Consider a countable set E and a matrix R : E x E — > K 
such that R(r],£,) > 0, r\ ^ £, -oo < i?(?7, 77) < 0, J2^ R (v,0 = 0, T] e E. Let 
A(?y) = —R(r],r]). Since A(^) is finite and strictly positive, we may define the 
transition probabilities {p(rj^) : r/,£ G £"} as 

P(»7,0 = TT^^'O for '?^e, (6.1) 

and p(rj,rj) = for 7/ G _E. We assume throughout this section that {p(r],t;) : r/,£ G 
E} are the transition probabilities of an irreducible and recurrent discrete time 
Markov chain, denoted by Y — {Y n : n > 0}. 

Let {rj(t) : t > 0} be the unique strong Markov process associated to the rates 
R(r], £). We shall refer to i?(-, •), A(-) and p(- } •) as the transition rates, holding rates 
and jump probabilities of {r](t) : t > 0}, respectively. Since the jump chain Y is 
irreducible and recurrent, so is the corresponding Markov process {rj(t) : t > 0}. We 
shall assume throughout this section that rj(t) is positive recurrent. In consequence, 
rj{t) has a unique invariant probability measure /i. Moreover, 

M( v ) := X(v)^ V ) , V G E , (6.2) 

is an invariant measure for the jump chain Y, unique up to scalar multiples. The 
proofs of these assertions can be found in Sections 3.4 and 3.5 of [TB]. We assume 
furthermore that the holding rates are summable with respect to \i: 

J2Kv)Kv) < 00, (6.3) 

77G-E 

so that M is a finite measure. Assumption (16.3[) reduces the potential theory of 
continuous time Markov chains to the potential theory of discrete time Markov 
chains. 

Let L 2 (fi) , 1? (M) be the space of square integrable functions / : E — > M endowed 
with the usual scalar product (/,<?),« = J2 v eE f( r l)9( r l) m { r l)i with m = jj,, M, 
respectively. Denote by P the bounded operator in L 2 (M) defined by 



(6.4) 
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for / G L 2 (M), and by L the generator of the Markov process {rj(t) : t > 0}. Thus, 
for every finitely supported function / : E — >• M, 

Let L* be the adjoint of the operator L on £ 2 (/i). L* is the generator of a 
Markov process {?7*(i) : t > 0} with holding rates A* and jump rates R* given by 

A*(?7) = Afa), i?*(r?,0 = tit)Rfov)/tiv), 80 that »(V)R*(V,0 = KOR&V)- Let 
P* be the adjoint of P in L 2 (M). Clearly, the bounded operator P* is given by 
(|6.4|) with p* in place of p. Denote by {Y* : n > 0} the discrete time Markov chain 
associated to P*. 

Similarly, let S be the symmetric part of the operator L on L 2 (n). S is the 
generator of a Markov process {?7 s (i) : t > 0} with holding rates A s and jump 
rates R s given by A s (r/) = \(rjj, R s {r/,0 = (1/2)LR(?7, £) + P*(r/,£)}, so that 
f x( V )R s (r ] ,0 = f ji(Z)R°(Z,r l ). 

Let P^, P* P* 7/ e E, be the probability measure on the path space D(R + ,E) 
of right continuous trajectories with left limits induced by the Markov process 
{r](t) : t > 0}, {r]*(t) : t > 0}, {ri s (t) : t > 0} starting from 77, respectively. 
Expectation with respect to P,,, P*, P^ are denoted by E I( , E*, E*, respectively. 

Denote by Ha (resp. H\), A C E, the hitting time of (resp. return time to) the 
set A: 

H A = inf{i > : r](t) e A} , 

H\ = inf{f > : r)(t) e A,T)(s) ^ 77(0) for some < s < t} . 

Let F be a proper subset of E. Denote by {Tt ■ t > 0} the time spent on the set 
F by the process rj(s) in the time interval [0,i]: 

Tt ■■= f l{v(a) eF}ds. 
Jo 

Notice that Tt S K+, P r) -a.s. for every 77 S E and i > 0. Denote by {St ■ t > 0} the 
generalized inverse of Tt'- 

S t := sup{s > : 7i < t} . 

Since {77(f) : t > 0} is irreducible and recurrent, lim^oo Tt — 00, P^-a.s. for every 
77 £ E. Therefore, the random path {i] F (t) : t > 0}, given by r] F (t) = rj(St), is 
P^-a.s. well defined for all r) € E and takes value in the set F. We call the process 
{i] F (t) : f > 0} the trace of {77(f) : t > 0} on the set P. 

Denote by R F (r],^) the jump rates of the trace process {r] F (t) : t > 0}. By 
Propositions 6.1 and 6.3 in [T], {rj F '(f) : t > 0} is an irreducible, recurrent strong 
Markov process whose invariant measure fi F is given by 

» F (0 = jTps m , (ef. (6.5) 

For each pair A, B of disjoint subsets of P, denote by rp(A, B) the average rate 
at which the trace process jumps from A to B: 

r F (A,B) := X>fa) £ i^foO • 
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By [U Proposition 6.1], 

r F (A, B) = Yl M ^ P n [ h f = H % ] ■ (6-6) 

We shall refer to rp(-, •) as the mean set rates associated to the trace process. 

Recall from [IT] that the capacity between two disjoint subsets A, B of E, 
denoted by cap(A, B), is defined as 

cap(A B) := Y, Pn [ #s < H+ ] . (6.7) 

r)EA 

Hence, by (|6.6j) for any two disjoint subsets A, B of E, 

ca P ( A B) = fi(A) r AuB [A, B) . (6.8) 

Let cap* (A, B), cap s (A, B) be the capacity between two disjoint subsets A, B of 
E for the adjoint, symmetric process, respectively. By equation (2.4) and Lemmata 
2.3, 2.5 in p~TJ, 

cap*(A,B) = cap(B,A) = cap(A,S) , cap s (A,S) < cap(A,B) . (6.9) 

Denote by cap F the capacity with respect to the trace process ij F (t). By the 
proof of [JJ Lemma 6.9], for every subsets A, B of F, A n £> = 0, 

^( J F)cap F (A, J B) = cap(A-B)- (6.10) 

Next result presents an identity between the capacities cap s (A, B) and cap(A, B), 



somehow surprising in view of inequality (16.91) . 
Lemma 6.1. Let A, B, C be three disjoint subsets of E. Then, 
cap(A,BUC) + cap{B,AuC) - cap(C,AuS) 

= cap s {A,BuC) + cap s (B,AllC) - cap s (C, A U B) . 

Proof. Taking F = A U B U C in (|6.10|> . we may assume that A, C forms 
a partition of E. In this case, since cap(C, A U B) = cap(A U B, C) and since 
E = A U B U C, by (|6.8p the left hand side of the previous equation is equal to 



))GA r)£B r]£AUB 

= Yrtv)R(v,B) + Y^)R(v,A) = 2Yv(v)R s (v,B) , 

»76A 7/6-B ?7GA 

where R(£,D) = Yl^eD R{£' 0- Performing the computations backward with i? s 
in place of i? we conclude the proof. □ 

We conclude this section proving a relation between expectations of time integrals 
of functions and capacities. Fix two disjoint subsets A, B of E. Denote by /ab, 
Jab '■ E ^ M. the harmonic functions defined as 

f AB (v) ■■= P„ [H A < H B ] , f* AB (v) ■■= P.* [Ha < H B ] . 
An elementary computation shows that f ab solves the equation 

(L/)fa)=Q fieB\(iUB), 

/(r?) = 1 7 ? G A , (6.11) 
/(r?) =0 rj E B . 
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and that f* AB solves the same equation with the adjoint L* replacing L. Clearly, 
we may replace the generator L by the operator / — P in the above equation, and 
(|6.1ip has a unique solution in L 2 (M) given by f ab- 
Define the harmonic measure vab, v \b on as 

M(7 ? )P„[ff+<g+] M( V )P;[H+<H+] 
VAB{T}) = TMA^) ' "**(") = ^^B) 

Denote by E„ AB the expectation associated to the Markov process {i](t) : t > 0} 
with initial distribution vab- 

Proposition 6.2. Fix two disjoint subsets A, B of E. Let g : E — >• K be a /i- 

integrable function. Then, 



E„« 



g{r,{t))dt 



cap(A, B) 

where (•, -} M represents the scalar product in L 2 (/i). 



3.12) 



Proof. We first claim that the proposition holds for indicator functions of states. 
Fix an arbitrary state If £ belongs to i? the right hand the left hand side of 

(|6.12[) vanish. We may therefore assume that £ does not belong to B. In this case 
we may write the expectation appearing in the statement of the lemma as 



-1 



n— v ' 

where {Y n : n > 0} is the discrete time embedded Markov chain, {e„ : n > 0} 
is a sequence of i.i.d. mean one exponential random variables independent of the 
jump chain {Y n : n > 0}, and H_g the hitting time of the set B for the discrete 
time Markov chain Y n . By the Markov property and by definition of the harmonic 
measure v* AB , this expression is equal to 

1 X)5>^) p 4 Y «=£' ri < H 5 



= X(^(A,B) g |>W P ^ < = " < Hb] • 

We may replace the hitting time and the return time ii^, JTj by the respective 
times Mb, M a for the discrete chain. On the other hand, since rj and £ do not 
belong to B, the event {Y"o = V, Yn — C , n < Mb} represents all paths that 
started from 77, reached £ at time n without passing through B. In particular, by 
the detailed balanced relations between the process and its adjoint, M(rj) P n [Y n — 
£, n < M B ] — M(£) P^[F„ = rj , n < H B ] and the last sum becomes 

] EE P *> [ Mb < M a ] P l i Y n = V , n < 1 B ] 



A(0 cap* (A, B) 



A(0 ffik, fl ) E E P l = V, n < , o W < H+ o , 



where we used the Markov property in the last step. In this formula {9k : k > 1} 
stands for the group of discrete time shift. Summing over rj the sum can be written 



18 



J. BELTRAN, C. LANDIM 



as 

X(O^P*(A,B) £ P « [YneA,n<W B ,W B o9 n <W+o 8 n ] 

The set inside the probability represents the event that the process Yk visits A 
before visiting B and that its last visit to A before reaching B occurs at time n. 
Hence, since M(£) = A(£)/x(£), since by (|6.9j) cap*(A, B) — cap(A, B) and since g 
is the indicator of the state £, summing over n we get that the previous expression 
is equal to 

1 ../CN r>* I~tut ^ tdt_1 _ (9 ' Jab) I* 



cap* (.4, 5) y tL J cap(A,B) 

By linearity and the monotone convergence theorem we get the desired result 
for positive and then /x-integrable functions. □ 

In the particular case where A = {77} for rj £ B we have that 

ff(7?(s))ds J = cap(M,g) (6 - 13) 

for any /z-integrable function g. This formula provides an estimation for the ther- 
malization. 

Let S be a finite set, let n = {A x : x G S} be a partition of £7, and let £ x be a state 
in A x for each x € 5. For each ^i-integrable function g denote by (g|7r) M 
the conditional expectation of <?, under /j,, given the c-algebra generated by n: 

/ I \ \ - (gl{^})M -, r 4X1 



For each x € S 1 , let 



cap(Cx) := inf cap({r)} , {£ x }) . 



The next result shows that if the process thermalizes quickly in each set of the 
partition, we may replace time averages of a bounded function by time averages of 
the conditional expectation. This statement plays a key role in the investigation of 
metastability. It assumes, however, the existence of an attractor. 

Corollary 6.3. Let g : E — >■ R be a fi-integrable function. Then, for every t > 0, 

\g\i{A"})» 



sup 



E)/ 



xes capfe) 



where \g\(f]) = \g(j])\ f or a ^ V * n E. 

The proof of this result follows from p] Corollary 6.5], formula (|6.13|) and the 
fact that f\ B is bounded by one. 
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